Distributed memory code generation for mixed Irregular/Regular computations

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Code Generation for Distributed Systems

described graphically, interest in code generators which create applications based on abstract system descriptions, is high. An indicator for this are the many commercial products. This paper aims at exploring the theoretical foundations of code generation for distributed systems with regard to the data structures and template language syntax. Several existing approaches are analysed and a new ...

متن کامل

Communication Lower Bounds for Distributed-Memory Computations

In this paper we propose a new approach to the study of the communication requirements of distributed computations, which advocates for the removal of the restrictive assumptions under which earlier results were derived. We illustrate our approach by giving tight lower bounds on the communication complexity required to solve several computational problems in a distributedmemory parallel machine...

متن کامل

Code Generation of Optimized Distributed-Memory Dense Linear Algebra Kernels

Design by Transformation (DxT) is an approach to software development that encodes domain-specific programs as graphs and expert design knowledge as graph transformations. The goal of DxT is to mechanize the generation of highly optimized code. This paper demonstrates how DxT can be used to transform sequential specifications of an important set of Dense Linear Algebra (DLA) kernels, the level-...

متن کامل

Automatic Generation of Parallel Code for Hessian Computations

Given a program to compute some function, automatic differentiation can be used to mechanically generate another program capable of evaluating firstand higher-order derivatives of that function. A new strategy for the computation of Hessians by automatic differentiation is proposed where the generated code is automatically parallelized using OpenMP. The approach is applied to compute second-ord...

متن کامل

Multiclass classification of distributed memory parallel computations

High Performance Computing (HPC) is a field concerned with solving large-scale problems in science and engineering. However, the computational infrastructure of HPC systems can also be misused as demonstrated by the recent commoditization of cloud computing resources on the black market. As a first step towards addressing this, we introduce a machine learning approach for classifying distribute...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM SIGPLAN Notices

سال: 2015

ISSN: 0362-1340,1558-1160

DOI: 10.1145/2858788.2688515